CDS

Accession Number TCMCG024C28909
gbkey CDS
Protein Id XP_022005196.1
Location join(128429557..128429704,128431451..128431690,128432085..128432231,128432544..128432622,128433031..128433214,128433300..128433365,128433470..128433688,128434846..128435052,128435132..128435435,128435756..128435915,128436028..128436272,128437016..128437071,128437163..128437462,128437586..128437705,128437817..128438485,128438592..128438756)
Gene LOC110903724
GeneID 110903724
Organism Helianthus annuus

Protein

Length 1102aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022149504.2
Definition beta-galactosidase [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyl hydrolase 2 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R01105        [VIEW IN KEGG]
R01678        [VIEW IN KEGG]
R03355        [VIEW IN KEGG]
R04783        [VIEW IN KEGG]
R06114        [VIEW IN KEGG]
KEGG_rclass RC00049        [VIEW IN KEGG]
RC00452        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01190        [VIEW IN KEGG]
EC 3.2.1.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00052        [VIEW IN KEGG]
ko00511        [VIEW IN KEGG]
ko00600        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00052        [VIEW IN KEGG]
map00511        [VIEW IN KEGG]
map00600        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTGCTTCTGCAACTATGGTTGTGAGTAACAAGTTGATTGTGAATTCAAGTAATGGGGATTATAAGGTGTGGGAAGATCCATCGTTTATCAAATGGAGGAAGAGGGATTCTCATGTTTCACTCCATTGTCATGATTCTGTTGAAGGTTCTCTAAAGTACTGGTATGAACGAAACAAAGTGGACGTTCTTGCTGCAAAATCAGCAGTTTGGGACGATGATGCTGTTTCCGGATCAATTGAATGTGCCAAACATTGGGTTAAAGATATGCCTTTTGTTAAATCGTTATCCGGTTACTGGAAATTCTTCCTTGCACAAAGTCCTACCGCTGCTCCTTCAAATTTCCAAGATACAGCGTTTCAAGACTCTACATGGGAAACAATACCAGTACCATCAAATTGGCAGATGCATGGGTTTGATCGACCCATATATACAAATATAATATATCCGTTTCCACTTGACCCACCCCGTGTTCCCGATGATAATCCCACTGGCTGCTACAGGACATATTTTCAACTTCCAAAAGATTGGGAAGGTCGACGGGTATTACTACACTTTGAAGCTGTTGATTCTGCTTTCCACGTGTGGATCAACGGGTCTCTTGTTGGATACAGTCAAGACAGTAGACTGCCGGCTGAATTTGAAATCACCGATTTCTGTCATGAGTGTGGATCTGACAAGAAGAATGTTATAGCTGTGCAAGTATATCGATGGAGTGATGGTTCTTATCTTGAAGATCAAGATCATTGGTGGTTGTCTGGTATTCATCGAGACGTGCTTCTGTTATCCAAACCAAAGGTATGCATAGCAGACTATTTCTTCACTTCGAATCTAGTGGAAGATTATTCCTATGCAGACCTCGAGGTTGAAGTAATACTTGACAAGTCAACGGAGGTCAATGTCAATAAAGATGTCAAAATTGAAGTCACACTGTTCGATATTAGCGGTAACGAGTGTACTGATCTTCTATCGACTGATGTGGCACGTTTAGAGCTTCATCCCCCTCCTAGAATGCCTTTAGGGTTTCACGGATATCGACTAACTGGAAAACTGAAAAATCCCAAGCTTTGGTCTGCAGAGCAACCAAATCTTTATACTTTAGTAGTCACCCTGAAAGATGCATCGGGTAATATCGTCGACTGTGAATCATGTCAAGTGGGCATTCGGAAGATTTCAAAAGCCCCGAAACAGTTACTTGTTAATGGGCATCCAGTTATGATCAGAGGGGTAAACAGGCATGAACACCATCCACGTATAGGAAAGACGAACATTGAATCTTGCATGGTTAAGGATTTGGTTTTAATGAAAGAACATAATATAAATGCTGTTAGAAACAGTCATTATCCTCAACATCCAAGATGGTACGAGTTATGCGATTTGTTTGGAATGTACATGATAGACGAGGCCAATATCGAGACACACGGTTTTGATCTTTCTCACCATGTCAAGCATCCAACTCAAGAACCGATTTGGGCCTCGGCTATGTTGGATCGCGTTATTGGCATGGTGGAAAGGGACAAAAACCACGCATGCATTATTTCTTGGTCTCTCGGAAATGAAGCAAGCTATGGACCAAATCATGCTGCTCTTGCTGGTTGGATTCGTGGAAAGGATCCTTCCCGAGTTATACACTACGAAGGTGGTGGGTCTCGGACCCCATCAACAGACATTGTATGTCCTATGTACATGCGTATCTGGGACTGTGTTAAGATAGCAAAAGATCCAACCGAAACGAGACCGCTAATATTGTGCGAGTATTCGCATGCCATGGGCAATAGCAACGGGAACATTCATGAATACTGGGAAGCCATTGATAGCACATTTGGTCTCCAAGGAGGATTTATATGGGATTGGGCTGACCAGGGGCTACTCAAAGAAAGTAGTGACGGTAGCAAGTTCTGGGCTTATGGCGGTGACTTTGGAGATACCCCTAATGATTTGAATTTCTGCATGAATGGTCTCGTATGGCCCGATCGGACCCCTCATCCTGCACTAAATGAGGTCAAGTATTGCTATCAACCAATTAAAGTGTCATTCACCGATGGCTTATTTAAGATCACAAACACCAATTTCTTTCAAACAACCGAAGGGGTAGAGTTTAGTTGGGTGATTGAAGGTGATGGATGTAAGCTTGAGTCGGGAAGTCTCAATCTACCGATGTTAGATCCACAAAGCAGTTACGATATCAAATGGGAATCCAGCCCGTGGTATCCATCATGGGCCTCGTCCTCTGCTGCCGAAACCTTTTTGACTATTACCGCAACTCTTTCTAAGCCCACACGATGGCTTCAATCTGGTCATGTTGTGTCGACTCAACAAATCGAGTTACCTTCAAAAAAAGACTTCATCTCCCCTGCCCCAAAGGTTAAAAAAGTCGCATTGAATTATGAAATCATAGACCATAAACTTACCATCCGACATAACGCCTCAGAGATAACGTTCGACAATGAGTCTGGTGCGATTGAAAGCTGGACGGTCGAAGGAGTTCCCGTGATGCGTAAGGGCATAACACCATGCTTTTGGCGTGCACCTACCGACAATGACAAAGGAGGAGAAGACAACAGTTATCTCTCAAAATGGAAAGCCGCGAATCTCGATAACGTTGTCTTCGTTAAAGAGAGCTCAAATGTTAAGAAGATCACAGACCAGCTACTAGAAGTAACCGTCGTGTTTAACGGTTTTTCAAAGGGTGGTGAAAACGAAAACCCTCTTTTCAAAGTCGACATGAAATACTCATTCTACGGTTCTGGAGACGTTATTTTGGTTAGCCATGTGAAACCAAGATCAGATCTTCCACCTTTGCCACGTGTTGGGGTCGAATTCCATTTGGAGAAGTCGATAAATAATGTTAAGTGGTATGGAAGAGGCCCGTTTGAATGTTATCCGGATCGAAAAGCGGCTGCGCACGTGGGGTCGTATGAGAAAAAGGTGGATGAGATGCATGTTCCTTATATAGTTCCGGGAGAATGTGCGGGCCGAGCTGATGTTAGATGGGTTGCATTCCAAAATGATCAAGGCTCAGGCATCTATGCTTCCGTTTATAATGACTCTCCACCGATGCAAATGAATGCAAGTTATTATAGCACAACGGAGCTTGATCGTGCAACACGCAAGGAAGAACTTGTGAAGGGAGATGACATTGAGGTGCATCTTGACCACAAGCATATGGGCATAGGTGGCGATGACAGTTGGTCTCCCGCGGTTCATGACAAATACATGATTCCACCTTCACCATGCACTTTCTCCATCAGGTTCTGTCCGATAACTGCTGCTACATCTCCCCATGATATCTATAAGGCTCGGTCTTGA
Protein:  
MAASATMVVSNKLIVNSSNGDYKVWEDPSFIKWRKRDSHVSLHCHDSVEGSLKYWYERNKVDVLAAKSAVWDDDAVSGSIECAKHWVKDMPFVKSLSGYWKFFLAQSPTAAPSNFQDTAFQDSTWETIPVPSNWQMHGFDRPIYTNIIYPFPLDPPRVPDDNPTGCYRTYFQLPKDWEGRRVLLHFEAVDSAFHVWINGSLVGYSQDSRLPAEFEITDFCHECGSDKKNVIAVQVYRWSDGSYLEDQDHWWLSGIHRDVLLLSKPKVCIADYFFTSNLVEDYSYADLEVEVILDKSTEVNVNKDVKIEVTLFDISGNECTDLLSTDVARLELHPPPRMPLGFHGYRLTGKLKNPKLWSAEQPNLYTLVVTLKDASGNIVDCESCQVGIRKISKAPKQLLVNGHPVMIRGVNRHEHHPRIGKTNIESCMVKDLVLMKEHNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSHHVKHPTQEPIWASAMLDRVIGMVERDKNHACIISWSLGNEASYGPNHAALAGWIRGKDPSRVIHYEGGGSRTPSTDIVCPMYMRIWDCVKIAKDPTETRPLILCEYSHAMGNSNGNIHEYWEAIDSTFGLQGGFIWDWADQGLLKESSDGSKFWAYGGDFGDTPNDLNFCMNGLVWPDRTPHPALNEVKYCYQPIKVSFTDGLFKITNTNFFQTTEGVEFSWVIEGDGCKLESGSLNLPMLDPQSSYDIKWESSPWYPSWASSSAAETFLTITATLSKPTRWLQSGHVVSTQQIELPSKKDFISPAPKVKKVALNYEIIDHKLTIRHNASEITFDNESGAIESWTVEGVPVMRKGITPCFWRAPTDNDKGGEDNSYLSKWKAANLDNVVFVKESSNVKKITDQLLEVTVVFNGFSKGGENENPLFKVDMKYSFYGSGDVILVSHVKPRSDLPPLPRVGVEFHLEKSINNVKWYGRGPFECYPDRKAAAHVGSYEKKVDEMHVPYIVPGECAGRADVRWVAFQNDQGSGIYASVYNDSPPMQMNASYYSTTELDRATRKEELVKGDDIEVHLDHKHMGIGGDDSWSPAVHDKYMIPPSPCTFSIRFCPITAATSPHDIYKARS